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The Invention provides DNA sequences encoding novel members of the TGF £ family of proteins. The TGF-p* family com- 
plies proteins which function as growth and/or differentiation factors and which aic useful in medical applications. According* 
ly, the invention also desuibw the isolation of the above-mentioned DNA sequences, the expression of the encoded proteins, the 
production of said proteins and pharmaceutical compositions containing said proteins. 
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DKA Sequences Encoding Novel Growth/ 
Differentiation Foctore 

[ 

The present invention relates to DNA sequences encoding novel 
growth /differentiate on factors of the TGF-fi family. In 
particular , it relates to novel DNA sequences encoding TGF-9-. 
like proteins, to the isolation of said DNA sequences, to 
expression placmids containing said DNA, to microorganisms 
tranefonned by said expression plasmid, to the production of 
said protein by culturing said transf ormant, and to pharma- 
ceutical compositions containing said protein • The TGF-8 
family of growth factors comprising BMP, TGF, and inhibin 
relared proteins (Koberts and Sporn, Handbook of Experimental 
Pharmacology 95 (1990), 419-472) is of particular relevance 
in a wide range of medical treatments and applications. These 
| factors are useful in processes relating to wound healing and 
: ' tissue repair. Furthermore , several members of the TGF-3 

family are tissue inductive, especially osteo-inductive, and 
consequently play a crucial role in inducing cartilage and 
bone development. 

I Wozney, pLogLttsis in Growth Factor Research 1 (1989), 267~280 
i and Vale et al*, Handbook of Experimental pharmacology 95 
i (1990), 211-248 describe different growth factors such as 
' ' those relating to the BMP (bone morphogfinffti^ proteinc) and 
' the Tnhihin group. Th* members of these groups share 

significant structural similarity. The precursor of the 
-protein is composed of an amino terminal signal sequence, a 
propeptide and a carboxytcrminal sequence of about 110 
amino acids, which iff subsequently cleaved from the precursor 
and' represents the mature protein » Furthermore, their members 
are defined by virtue of amino acid sequence homology. The 
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nature protein containe the most conserved sequences/ 

i 

especially s von cystein rocidu s which arc conserved among 
the family members* The TGr-S-like proteins are 
multifunctional , hormonally active growth factors- They also 
share related biological activities such as chemotactic 
! attraction of cells , promoting cell differentiation and their 
i tissue-inducinq capacity, such as cartilage- and. bone- 
inducing capacity. U.S. Patent No. 5,013,649 discloses DNA 
sequences encoding osteo-inductive proteins termed BMP-2 
proteins (bone morphogenetic protein), and U.S» patent 
applications serial nos . 179 101 and 179 197 disclose the BMP 
proteins BMP-1 and. BMP— 3. Furthermore, many, cell types are 
\ able to synthesize TGF-B-lik'e proteins and virtually all 
cells possess TGF-0 receptors. 

Taken Logether, these proteins show differences in their 
structure, leading to considerable variation in their 
detailed biological function. Furthermore, they are found in 
a wide, variety of different tissues and developmental stages- 
1 Consequently, they might, possess differences concerning their 
| function in detail, for istance the required cellular 
j physiological environment, their lifespan, their targets, 

their requirement for accessory factorb, <md their resistance 
to degradation. Thus, although nuuiexous proteins exhibiting 
tissue-inductive, especially os Leo-inductive potential are 
described, their natural role in the organism and, more 
j importantly, their medical relevance must still be elucidated 
; in detail. The occurrence of still-unknown members of the 

i ^ 

! TGF-0 family relevant for osteogenesis or 

' dif f erentiation/ induct ion of other tissues is strongly 

suspected- However, a major problem in the isolation of these 
new TGF-FUlike proteins is that their functions cannot yet be 
described precisely enough for the design of a discriminative 
hioassay. On the other hand, the expected nucleotide sequence 
homology to known members of the family would be too low to 
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allow for screening by classical nucl ic acid hybridization, 
.techniques. Nevertheless , the further isolation and 
characterization of new TGF-8-like proteins is urgently 
' needed in order to get hold of the whole Bet of induction and 
differentiation proteins meeting all deaired medical 
requirements. These factors might find useful medical 
applications in defect healing and treatments of degenerative 
disorders of bone and/or other tissues like/ for example/ 
kidney and liver. 

Thus, the technical problem underlying the present invention 
essentially is to provide DNA sequences coding for new 
members of the TGF-fl protein family having mitogenic and/or 
differentiation- inductive, e.g. oeteo-inductive potential. 

Thc solution to the above technical problem is achieved by 
providing the embodiments characterized in claims 1 to 17. 
Other features and advantages of the invention will be 
apparent from the description of the preferred embodiments 
and the drawings. The sequence listings and drawings, will now 
'briefly be described. 

STO TP NO. 1 showc the nucleotide sequence of MP-S2, i.e. the 
embryo derived sequence corresponding to the uiaLure peptide 
and most of the sequence coding for the propeptide of HP-52. 

Some .of th<e propeptide sequence at the 5' -end of MP-52 has 
not been characterized so far. 

SEQ. ID HOi__2* shows the bo far characterized nucleotide 
sequence of the liver-dp.rived sequence MP- 121- 

SEO TP NO. 3 shows the amino acid sequence of MP-S2 as 
deduced from SEQ ID MO. 1. 
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yi^ure shows an alignment of the amino acid sequences ot 
MP-52 and HP- 121 with some related proteins, la shows the 
alignment of HP-52 with some members of the BMP protein 
family starting from the first of the ocven conserved 

i cysteine; lb shows the alignment of MF-121 with some members 
of the Inhibin protein family. * indicates that the amino 

1 acid is the same in all proteins compared; +■ indicates that 
the amino acid is the same in. at least one of the proteins 
compared with MP-52 (Fig. la) or KP-121 (Fig. lb.). ■ 

Figure 2 shows the nucleotide sequences of the oligo- 
nucleotide primer as used in the present invention- a*d an 
alignment of these oequencee with known members of the-/ TGF-G 
i family. M means A or C; S means C or C; R means K or G; and K 

means G or T. 2a depicts the sequence of the primer OD;; 2b 
shows the sequence of the primer OID.. 

The present invention relates to novel TGF-B-like proteins 
and provides dna sequences contained in the corresponding 
genes. Such sequences include nucleotide sequences 
comprising the sequence 

ATGAACTCCATGCACC-f-CGAGTCCACA and 

CTTC TC AAGGC C AAC AC AGC TGCAGGC ACC 
and in particular sequences as illustrated in SEQ ID Noc. 1 
and 2 T allelic derivatives of said sequences and DNA 
sequences degenerated as a result of the genetic code for 
said sequences* They also include DNA sequences hybridizing 
uudex, sLrinyeiit conditions with the DNA sequences mentioned 
above and containing the following amino acid sequences: 

rtet-Asn-ser-Met-Asp-Pro-Glu-Ser-Thr or 

Leu~Leu~Lys-Ala-Asn-Thr-Ala-Ala-Gly-Thr . 

Although said allelic, degenerate and hybridizing sequences 
may have structural divergencies duo to naturally occurring 
mutations, cuch as small deletions or substitutions, they 
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vill usually still exhibit essentially the same useful 
properties , allowing their use in basically the same medical 
applications . 

According to the present invention, the term "hybridization" 
means conventional hybridization conditions, preferably 
conditions with a salt concentration of 6 x SSC at 62° to 
66°C followed by a one-hour wash with 0.6 x SSC r 0,1% SDS at 
62° to 66°C. The term "hybridization" preferably refers to 
stringent hybridization conditioxie with a salt concentration 
of 4 x SSC at 62 C -66°C followed by a one-hour wash with 0.1 x 
SSC, 0.1% SDS at 62*-66*C. 

Important bioloqical activities of the encoded protein* 
comprise a mitogenic and osteoinductive potential and can be 
determined in assays amording to Roberts et al- r PNAS 78 
(1981), 5.139-5343, Seyedin et al., PNAS 82 (1985), 2267-2271 
or Sampath and Reddi f PNAS 78 (1981) , 7599-7603. 



Preferred embodiments of the present invention are DNA 
sequences as defined above and obtainable from vertebrates r 
preferably maxruoali* such as pig or cow and from rodents such 
as rat or mouse, and in particular from prijnates such as 
humans. 

Particularly preferred embodiments of the present invention 
are the DNA sequences termed MP-52 and HP- 121 which are shown 
in SEQ ID^-Noe- 1 and 2. The corresponding transcripts of MP- 
52 WATQ obtained from embryogenic tissue and code for a 
protein showing considerable amino acid homology to the 
mature part of the BMP-like proteins (see Fig. la). The 
protein eequences of BMP 2 (-BHP2A) and HMF4 (— BM£"2B ) are 
described in wozney et al., Science V 1 242 , 1528-1534 
(1988). The respective sequences of BMP5 , BMP6 and BMP? are 
described in Celeste et al., ProcNatl.Acad.Sci. USA Vol 87, 
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9843-9847 (1990}. Some typical sequence homologies , which are 
specific to known BMP-sequences only, were alBo found in the 
propeptide part of mf-52, whereas other parte of the 
precursor part of HP-52 show marked differences to BHP- 
prectirsors. The mRNA of MP-121 was detected in liver tissue, 
and its correspondig amino acid sequence shows homology to 
the amino acid sequences of tha Inhibin protein chai-ns (see 
Fig. lb). cDNA sequences encoding TGF-G-likc proteinc have 
not yet been isolated from liver tissue, probably due* to a 
low abundance ol TGF-B specific transcripts in this tissue. 
In enibryogenic tissue , however, sequences encoding known TGF- 
Q-like proteins can be found in relative abundance. The 
inventors have recently detected the presence of a collection 
of TGF-3-like proteins in liver as well. The high background 
level of clones related to kovn factors of th.i n group 
presents "the. main difficulty in establishing novel TGF-fl- 
related sequences from these and probably other tissues* In 
the present invention, the cloning was carried out according 
to the method described below. Once, the DNA sequence has been 
cloned, the preparation of host cells capable of producing 
the TGF-Q-like proteins and the production of said proteins 
can be easily accomplished using known recombinant una 
techniques comprising constructing the expression plasmids 
encoding said protein and transforming a host cell with said 
expression plasmid, cultivating the transformant vn a 
suitable culture medium, and recovering the product having 
TGF-Q-like_ activity. 

Thus, the invention also relates to recombinant molecules 
comprising DNA sequences as described above, optionally 
linked Uu air expression conLiul sequence. Such vectors may be 
useful in the production of TGF-3-like proteins in stably or 
transiently transformed cells, several animal, plant, fungal 
and bacterial systems may be employed for the transformation 
and subsequent cultivation process. Preferably, expression 
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vectors which can be used in the invention contain sequences 
necessary for the replication in the host call and arc 
autonomously replicable. It is also preferable to use. vectors 
containing selectable marker gene© which can be easily 
selected for transformed cells . The necessary operation is 
well-known to those skilled in the art. 

It is another object of the invention to provide a host cell 
transformed by an expression plasmid of the < invention and 
capable of producing a protein of the TGF-S family. Examples 
of suitable host cells include various eukaryotic and 
prokaryotic cells, such as B. coli, .insect cells, plant 
Cftlls, mammalian cells, and fungi such as yeast* 

Another object of the present invention is to provide a 
protain of the TGF-S family encoded by the DNA sequences 
described above and displaying biological features such ae 
tieoue-inductive, in particular osteo-inductive and/or 
aitogeuic capacities possibly relevant to therapeutical 
treatments. The above-mentioned features of the protein might 
vary depending upon the formation of homodimers or 
heterodimers. Such structures may prove useful in clinical 
applications an well. The amino acid sequence of an 
especially preferred protein of the TGF-Q -family (MP-52) is 
shown in SEQ ID NO- 3. 

It ia a further aspect of the invention to .provide a process 1 
for the production of TGi-B-liJce proteins. Such a process 
comprises cultivating a host cell being transformed with a 
DNA sequence of the present invp.ntion in a suitable culture 
medium and purifying the TGF-Q-like protein produced- Thus/ 
this process will allnw the production of a sufficient amount 
of the desired protein for use in medical treatments or ia 
applications using cell culture techniques requiring growth 
factors for their performance. The ho*t cell is obtainable 
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from bacteria such as Bacillus or Escherichia eoli, from 
fungi such as yeast, from plaaL» »uch as tobacco, potato, or 
1 Arabidopsis, and from animals, in particular vertebrate cell 
linee such as the Ho-, COS- or CHO cell line. 

Yet another aspect of the present invention is to provide a 
■ particularly sensitive process for the isolation of DNA 

sequences corresponding to low abundance mRNAs in the tissues, 
of interact. The proceco of the invention comprises the 
combination of four different steps. Fir.st, the mRNA has to 
be isolated and used in an amplification reaction using . 
; olignucleotide primers •* The sequence of the oligonucleotide 
primers contains degenerated DNA sequences derived from the 
! amino acid sequence of proteins related to the gene of 
! interest* This step may lead to the amplification 6f. already 
known members of the gene family of interest , and these . 
undesired sequences would therefore have to he eliminated* 
This object is achieved by using restriction endonucleasec 
which are known to digest the already-analyzcd members of the 
i gene family* After treatment of the. amplified DNA population 

with said restriction endonucleases, the remaining desired 
• DNA sequences are isolated by gel electrophoresis and 

reamplified in a third step by an amplification reaction-, and 
in a fourth step they are cloned into suitable vectors for 
sequencing. To increase the sensitivity and efficiency r steps 
two and three are repeatedly performed, at least two tijnes in 

! one embodiment of this process. 

i 

1 In a preferred embodiment, the isolation process described 

above is used for the isolation of DNA sequences from liver 
tissue • In a particularly preferred embodiment of the above- 
described process, one primer used for the PCK experiment is 
homologous to the polyA td.il of the mRHA, whereas the second 
priaiex contains a gene-specific sequence. The techniques .... 
employed in carrying out the different steps of this process 
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[such as amplification reactions or sequencing techniques) 
are known to the pernon skilled in the art and described, for 
instance, in. Sambrook et al., 1989/ "Molecular Cloning: A 
laboratory manual Cold Spring Harbor Laboratory Press. 

It is another objecl of the present invention to provide 
pharmaceutical compositions containing a therapeutically- 
effective amount of a protein of the TG7-8 family of the 
present invention. Optionally, Kuch a composition con^rises a 
pharmaceutical^ acceptable carrier- Such a therapeutic 
composition can be uced in wound healing and tissue repair as 
w^ll as in the healing of bone, cartilage, or tooth defects, 
either individually or in conjunction with suitable carriers, 
and poceibly with other related proteins or growth factors. 
Thus, a therapeutic composition of the invention may include, 
but is not limited to, the MP- 5 2 encoded protein in 
conjunction with the MP- 12 I encoded protein, and optionally 
with other known biologically-active substances cuch as egf 
(epidermal growth factor) or PDGF (platelet derived growth 
factor). Another possible clinical application of a. TGF-fi- 
like protein is the use as a suppressor of the inaauno 
response, which would prevent rejection of organ transplants. 
The pharmaceutical composition comprising the proteins of the 
invention can also be used prophylactically , or can be 
employed in cosmetic plastic surgery- Furthermore, the 
application of the composition is not limited to humans but - 
cau include animals, in particular domestic animals, ac 
well. \ 

Finally, another object of the present invention is an 
antibody nr antibody fragment, which is capable of 
specifically binding to the proteins of the present 
inv*ntion- Methods to raise such specific antibody are 
general knowledge- Preferably such an antibody is a 
monoclonal antibody. Such antibodies or antibody fragments 
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inight be useful for diagnostic methods* 

The following examples illustrate in detail the invention 
disclosed, but should not be construed as limiting the 
invention. 

Isolation of MP-121 

1.1 Total RNA was isolated from, human liver tiesue 
old-male) by the method of Chirgvin et aim, Biochemistry 
18 (1979), 5294-5299. Poly A* RNA was separated from 
total RNA by oligo (dT) chromatography according to the 
instructions of the manufacturer (Stratagene Poly. (A) 
Quick columns) . 

1.2 For the reverse transcription reaction, poly A* RNA (1- 
2-5 pg) derived from liver tissue was heated for 5 

.minutes to 65 °C and cooled rapidly on ice- The reverse 
transcription reagents containing 27 U RNA guard 
(rharmacia) , 2.5 jig oligo d(T) 12 -tfi (Pharmacia) 5 x 
buffer (250 mil Tris/HCl pH 8.5; 50 mM MgCl 2 ; 50 mM DTT; 
5 mM each dNTP; 600 mM KCli and 20 units avian 
myeloblastosis virus reverse transcriptase (AMV, 
Boehringer Mannheim) per yg poly (A*) RNA were added . 
The reaction mixture (25 //!) was incubated for 2 hours 
at 42°C. The liver nDNA pool was stored at -20 °C. 

1.3 The deoxynucleotide primer c OD and OID (Fig- 2) designed 
to prime the amplification reaction were generated on an 
automated DNA-synthesizer (Biosearch ) . Purification was 
done by denaturating polyacrylamide gel electrophoresis 
and isolation of the main band from the gel by 
isotachophoresis . The oligonucleotides were designed by 
aligning the nucleic acid sequences of some Known 
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members ot the TGF-6 family and selecting regions of the 
highest conservation. An alignment of ti« region ic 
shown in Fig. 2. In order to facilitate cloning , both 
oligonucleotides contained EcoR I restriction sites and 
OD additionally contained an Nco I restristion site at 
its 5' terminus* 

1.4 In the polymerase chain reaction, a liver-derived cDHA 
. pool was used as a template in a 50 pi reaction mixture. 
The amplification was performed in 1 x PCR-buffcr (IS -6 
mM (NH A ) 2 S0 A ; 67 TnM Tris/HCl pa 8,8; 2 mK MgCl 2 ; 6.7 fdf 
EDTA? 10 mM B-mercaptoQthanol; 170 ^g/ml BSA (Gibco)), 
200 fjM each dNTP (Pharmacia) , 30 pmol each 
oligonucleotide (OD and OID) and 1.5 units Taq 
polymerase (AmpliTaq, PerJtin Slmer Cetus ) , The PCR 
reaction contained cDNA corresponding to 30 ng of poly 
(A* ) RNA as staring material. The reaction mixture was 
overlayed by paraffine and 40 cycles (cycle Is 80s 
93°C/40s 52 w C/40s 72°C; cycles 2-9: SOs 93°C/40e 
52°C/40b 72°C; cycles 10-29: 60s 93°C/40e 52°C/60a 
72°C; cycles 30-31: SOs 93°C/40c 52°C/90s 72°C; cycle 
40: 60s 93°C/40s 52°C/420e 72°C) of the PCR were 
performed- Six PCR-reaction mixtures were pooled f 
purified by subsequent extractions with equal volumes Ot 
phenol, phenol/chlurof orm (1:1 (v/v)) and 
chlorofotWisoamylalcohol (24:1 (v/v)} and concentrated 
by eLhanol precipitation. 

1-5 One half of the obtained PCR pool vas sufficient for 

digeetidn with the restriction enzymes Sph I (Pharmacia) 
and AlwN I (Biolabfi). The second half was digested in a 
series of reactions by the restriction enzymes Ava I 
(BRL), AlwN I (Biolabe) and Tfi I (Biolabs). The 
restriction andonuclcasc digestions were performed in 
100 k1 at 37°C (except Tfi I at G5 rt C) utfiug 8 units of 
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each enzyme in a 2- to 12-hour reaction in a buffer 
recommended by the manvif acturer . 

1.6 Each DHA sample was fractioned by electrophoresis using 
a 4% agarose gel (3% FMC Nusieve agarose r Biozym and. 1% 
agarose, BRL) in Tris borate buffer (89 mM Trisbase, 89 
mM boric acid, 2 mM EDTA, pa 8). After ethidiumbromide 
staining uncleaved amplification products (about 200 bp; 
ei2© marker wa« run In parallel) were excised from the 
gal and isolated by phenol extraction: an equal volume 
of phenols was added to the excised agarose, which was 
minced to small pieces , frozen for 10 minutes, vortexed 
and centrifuged. The aqueous phase was collected, the 
interphase reextracted by the same volume TE-buffeir, 
centrifuged and both aqueous phases were combined. DNA 
was further purified twice by phenol /chloroform and once 
by chlorofonn/isoamylalcohol extraction. 

1.7 After ethanol precipitation, one fourth or one fifth of 
the isolated DNA was reamplified using the same 
conditions used for the primary amplification axcept for 
diminishing the number of cycles to 13 (cycle 1: 80s ■ 
93°C/40e 52°C/40a 72 Q C; cycles 2-12: 60s 93°C/40s 
S2°C/60s 72°C; cycle 13: COs 93°C/40s 52°C/420s 
72°C). The re<nupli£icalion products weie purified, 
restricted with the same enzymes as above and the 
uncleaved products were isolated from aqarose qels as 
mentioned above for the amplification products. The 
reamplification followed by restriction and gel 
isolation was reppatpd onop.. 

1-6 After the last isolation from the gel, the amplification 
products were digested by 4 unite EcoR I (Pharmacia) for 
2 hours at 37 °C using the buffer recommended by the 
manufacturer- One fourth of the restriction mixture was 
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ligated to the vector pBlucscriptll SX+ (Stratagcnc) 
which was digested likewise by EcoR I- After ligation, 
24 clones from each enzyme combinatiua were further 
analyzed by sequence analysis. The sample restricted by 
AlvN I and Sph I contained no new sequences/ only BMP 6 
and Inhibin 3A sequences. 19 identical new sequences, 
which were named KP-121, were found by the Ava I, AlvN I 
and Tfi I restricted samples. On a sequence differed from 
this mainly-found sequence by two nucleotide exchanges. 
Ligation reaction and transformation in E. coli HB101 
were performed as described in Sambrook et al., 
Molecular cloning: A laboratory manual (1989). 
Transf ormants were selected by Ampicillin resistance and 
the plasmid DNAs were isolated according to standard 
protocols (Sambrook et al. (1989)). Analysis was done by 
sequencing the double-stranded plasmids by 
"dideoxyribonucleotide chain termination sequencing" 
with the sequencing kit "Scqucnase Version 2.0" (United 
States Biochemical Corporation). 

The clone was completed tu the 3' end of the c-DNA by a 
method described in detail by Frohman ( Amplif icat ions , 
published by Perkin-Elmer Corporation, issue 5 (1990)/ 
pp 11-15). The same liver mRNA which was used for the 
isolation of the tirst fragment of MP- 121 vaa reverse 
transcribed using a primer consisting of oligo dT (16 
residues) linked to an adaptor primer 

(AGAA^r-«!ATGCCATGGTCGACGAACC(T) 16 ) . Amplif icat ion was 
performed using the adaptor primer 

(AGAATTCGCATGCCATGGTCGACG) and an internal primer 
(GGCTAC'GCCATGAACTTCTGCATA) of the MP- 121 sequence. The 
amplification products were reamplif ied using a nested 
internal primer ( ACATAGCAGGCATGCCTGGTATTG ) of the MP-121 
sequence and the adaptor primer. The reamplification 
products were cloned after restriction with Sph I in the 
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likewise restricted vector pT7/T3 U19 (Pharmacia) an. 
sequenced with the sequencing kit "Scquepase Version 
2.0" (United States Biochemical Corporation). Clones 
were characterized by theix sequence overlap to the 
end of the known sequence. 



j Bxarnple. 2 

• Isolation of MP- 5 2 

i a further cDNA 5equ«uc:ti, HP-52, was isolated accoxding to the 

above described method (Example 1) by using RNA from human 
embryo (8-9 weeks old) tissue. The pck reaction contained 
! cdna corresponding to 20 ng of poly (A T jrna as starring 
j material. The reamplification step was repeated tvice for 
! both enzyme combinations. After ligation r 24 clones from eaeh 

enzyme combination were further analyzed by sequence 
analysis;- The cample resticted by AlwN I and Sph I yielded a 
new sequence which was named HP-52. The other clones 
comprised mainly BHFS and one BMP 7 sequence. The sample 
restricted by Ava I, AlwN I and Tfi I contained no. new 
| sequencer, but cons>is>ted maiuly* of BMP 7 and <a few Inhibit QA 

! sequences. 

i 

i 

The clone was completed to the 3' end according to the above 
described method (Example 1). The same embryo mRNA, which was 
used for the isolation nf the first fragment of MP-5 7, vas 
reverse transcribed as in Example 1- Amplification was 
I performed using the adaptor primer ( ACAATTCCCATGCCATCGTCCACC) 

j and an internal primer ( CTTCACTACGAGGCTTTCCACTC ) of the HP-52 

' sequence. The amplification products were reamplified using a 

nested adaptor primer (ATTCGCATGCCATGGTCGACGAAG) and a nested 
internal primer (GGAGCCCACGAATCATGCAGTCA) oX the MP -5 2 
sequence. The reamplif ication products were cloned after 

i 

! 

i 
j 

i 

t 
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restriction vith Kco I in a likcvicc restricted vector (pUC 
19 (Pharmacia #27-4951-01) with an altered multiple cloning 
site containing a unique Nco I restriction site) and 
sequenced. Clone* were characterized by their sequence 
overlap to the 3' end of the Jcnovn MP-52 sequence. Some of 
these clones contain the last 143 basepairs of the 3' end of 
the sequence shown in SEQ ID NO: 1 and the 0,56 kb 3' non 
translated region (sequence not shown). One of these was used 
as. a probe to screen a human genomic library (Stratagene 
#9-46203) by a common method described in detail by Ausubel et 
al. (Current Protocols in Molecular Biology , published by 
Greene publishing Associates and Wiley-interscience (19B9)). 
From 8xl0 5 X phages one phage ( X 2.7.4) which was proved to 
contain an insert of about 20 kb, was isolated and. deposited 
by the DSM (#7387). This clone contains in addition to the 
sequence isolated from mRNA by the described amplification 
-methods sequence information further to the 5' and. For 
sequence analysis a Hind III fragment of about 7 r S kb was 
subcloned in a likewise restricted vector (Dluescript SK, 
Stratagene *212206). Thie plasmid, called SKL 52 (H3) MF12, 
was also deposited by the DSM (# 7353). Sequence information 
derived from this clone is nhowa in SEQ ID NO: 1. At 
nucleotide No. 1050, the determined cDNA and the respective 
genomic sequence differ by one basepair (cDNA: G; genomic 
DNA: A) - We assume the genomic sequence to he correct, as it 
was confirmed also by sequencing of the amplified genomic DNA 
from embryonic tissue which had been used for tha mRNA 
preparation. The genomic DNA contains an intron of about 2 kb 
between basfipairs 332 and 333 of SEQ ID NOi 1. The sequence 
of the intron is not shown. The correct exon/exon junction 
was confirmed by sequencing an aiuplif icaLion product derived 
from cDNA which comprieea this region. This sequencing 
information wae obtained by the help of a slightly modified 
method described in detail by Frohman (Amplifications, 
published by Perkin-Elmer Corporation, issue 5 (1990), pp Il- 
ls). The same embryo kna which was used for the is lation of 



NM 663" P. 33/45'. 
PCT/EPW/00350 

the 3 9 end of KP-52 was reverse transcribed using an internal 
primer of the MP-52 sequence oriented in the 5' direction 
(ACAGCAGGTGGGTGGTGTGGACT) . A polyA tail was appended to the 
5' end of the first strand cDNA by using terminal 
transferase. A two step amplification was performed first by 
application of a primer consisting of oligo dT and an adaptor 
primer (AGAATTCCCATCCCATGCTCGACGAAGC(T 1& ) ) and secondly an 
adaptor primer (AGAATTCGCATGCCATGGTCGACG ) and an internal 
primer (CCAGCAGCCCATCCTTCTCC) of the HP-52 sequence. The 
amplification products were reamplified using the same 
adaptor primer and a nested internal primer 
( TCCAGGGCACTAATGTCAAACACG ) of the HP-52 . sequence . 

i 

Consecutively the reamplification product* were again 
reamplified using a nested adaptor primer 
(ATTCGCATGCCATGGTCGACGAAG) and a nested internal primer 
(ACTAATGTCAAACACGTACCTCTGJ of the MP-S2 sequence. The final 
i reamplification products were blunt, end cloned in a vector 
j (Bluescript SX, Stratagene 4212206) restricted with EcoRV. 

Clones were characterized by Lheir sequence overlap to the 
| DNA of X 2.7.4. 

Plasmid SKL 52 (H3) MF12 vas deposited under number 7353 at 
DSM (Deutsche Sammlung von Mikroorganismen und Zellkulturen) , 
Mascheroder Weg lb, 3300 Braunschweig, on 10.12.199?.. 

| Phage X 2*7.4. was deposited under number 7387 at DSM on 

! 13.1. 199 3. 

i 
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SEQ H> BO: 1 

SEQUENCE TYPE: Nucleotide 
SBQPENCE liOGEH: 1207 base pairs 

STOANEEENESS: Axible 
' I QEQEOCTs linear 

TXEE: ZK^ 

ORIGINAL SOURCE: - 
ORGfiNISM: liuman 

IMMEDIATE EXPERIMEWEKL SOURCE: Efctaryo tissue 

PROPERTIES: Sequence coding for hunan TSMS-like protein (MF-52) 

ACDQQGOQGC CCK3ttC0CA AQCCftQSiCA COCTODDCWL ACMG3CM33 OaCfiGOOOG. 60 

'ea L ' imu cc ora A wra c j ujltiujo qs Afloa&agczi irrrff M AaG eao aaiutar 120 

ccocacczcc iiuaucaa , aaflGQoraG g^oooogog azcrrtcac Aooararca. . ibo 

Uujnaouc (racoomcA iggrnn cGaaracRro luuiujltui ' ACMfflcocx 240 

GtTOQGKiacr GBfflGRMGS OfflOGlEftAG rl\3£ g *to.:iQ GCCES3CC&A. 300 

CAOCfiia^C AdLlTiAi ' lVj JCAMGaaCA AGRIG&CQa QSKXCSCG^ TCaOG&aQCA 360 

GftGaE&CSTC TrlGACAl'J^A GlUXCTUoA GMGoAIQ3j cattaU«dtt OOGAGCTOCG 420 

GftlLUL'IUJHG AftGMGGCCT a3GK2033C CftfiffXMOG GD0D00GGBG amjJfJ.-H: 480 

raarftscn; AflGatatEA mmu'AG coocmscfiG cr mnHc cr maror 540 

mJL 'i iX HL is commas xxhocke cictmsto ttcgacaitt ggaagctctt soo 

nfiraMCjrr AAfluvrronR (Ttwxtictg ccroagaG gmscctoss AacoooonG 660 

oxcoroac ci t ^ui^c c i i sw/riuA aTrocm oc OGsaoaiEC ACGafflAGoc 720 

• uaxatouii; Gran-raoac GG&ccraa& aoaaaccic TiemiMBC aohtmggc 780 

OOC rTCJ QQC CflflGaCGMCA AffiCOGZGZA TfflGIMCIG CTCftGQCflGS GGCGftAMCG 840 

QO9GG0O0CA CIGGCCSOC GOOGQQCfA GCCMCCM£ AftffACCTTA AGSCICGCTG 900 

CAGICGSaftG GCACTGCAIG TCAftCITCAft, QSCftlGOX lG33M3Cr GGftlCfllOQC 960 

* A X n TI VSG TBGfflLULTl \faU:i\Sl\&* fflbTIUOAX TQCQCTXCA 1020 
CCTBgOnOC JCGftftlCOTG CfiCnOflGCA GttXClGAlG AXmSBBIC 1080 

CflCflf.T.;wTTr auoultcig raxiarwii GcrraGroec /acRGCfticc lUTivmiiaA 1140 

aciQXAftc ju Muaur AiafloaciA raGacaiG GTOioaor ogigiqgcig 1200 

caaarac 1207 



1 
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sbq TP MO; 7 

S tmjtfJm T2PE: Nucleotide 
SBQUHOE IStClB: 265 has* paixs 

STOfiNDEENESS: Single 

TUFGLOG*: Linear 

KKECULBR CENft. to nRSA 

ottginal sascz: 

OFfflSISM: Human 

M4EDIKIE EXPEHIMENTM, SOURCE: Liver tissue 
FlIOl'JfcKI ' lE S* p uman TGF-Q-liJkfl protein (MP-121) 



60 
120 



caiaaGCcr (sqsqcibcg acATsacrr crocsflKSGG caGiaxoc macamoc 

jCrpflirnpr 633 EKn O J G UClCCTriCA CaCIGCagrO CTCMUCl'll! IC&fltiOCCft& 

cacaono^ csocficcftciG GftGaxocic aigcigxoia onr vrmr <jj. Mxmr r wo 

GTCICKCIC 'IAHATGftCA Q330GCAA CfllTOTCMG flOGBCMaC CTSOlGST 240 
ftGBKSGSX TGTG33TOCA GEEKi 



263 



ciiRCiTlTllTF SHEET 
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SEP ID WQ g^3 

SEQJEXEZ TTZFE; fsaxno acid 
SEQUBCZ IQS2XE; 401 amino acids 

gsoxiiskl soqtce: - 

human 



i 

PROPERTIES: Human protein (HP-52) 



; * FGGEEFKFGB PPOTRQATRR TVHOT3DPG QKRPEK»GSV tfSSZHKOR gQtt'KEPKK 60 

EE^EEPHPH EXMLSnnOCL SQftDRKGGNS b*VKI£BGI«l ' IT I KK in KHQ DDHSWRKQ 120 

; KTVTOIS&IS KDGLDmELK ilRKKPSDIA KPAftPQQGRA AQEXTSSCTS GRQPASLEXJV' 180 
. BSVPGU3GSG WEVEDIWKU 1 KNEMEBOEC IEEEMJERGR AVEOGLffiD RflABQVHEKa/ . 240 

: EFIOTSJIKK RDEEINEIKA BSGUUKLVY EXEF9QERKR RRELKERQC3C KPSKNtiORC 300 

SEKBUSVNEK IM5WDTJWI1A GLCEEPLKSH TEMNSMDFES 360 

TPPTCTVPER LSPISLLFTD SSNNWXKQy ECHWESOGC R 401 
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Ficpttr la 

10 20 30 40 50 

MP 52 CSRKftrflVNF KDM3JKWI AEIZXEfiSBC fcfitlEEKBS HEEETOBAVI 

BMP 2 CKRHPLYVDF SiMSHNDWIV APKHBRFK! BGECPFPT.AD HTMSTOHAIV 

BMP 4 CBRHSISVDF SEWGHNCHTV APPGXQMTC B3DCPEPIAD ELNS1NEAIV 

BMP 5 CKKHFTiYVSF RT£OJQDWn AEBGffiAFTC DQSCSEEItfA EMNftJHHAIV 

BMP 6 CRKBELYVSF QDLO>QDWII AHOSHMttWC DGBCSFEIttA HMNAlKUAIV 



BMP 7 CKKHELYVSF BDD3KQDHH AEBGZRftYYC FT^Tfl FPIMS YMNAINHATV 
; + x» » ± * +** + .j-j. **** • 

j 

I 
i 

I 60 70 80 90 100 

j MP 52 CHMS©PE STPPTCZVPT HTSPISTLFI KOWVESCG a* 

j BMP 2 ■ QTLVNSVN&- KTPKWXVPT ELSAISMLYL DEKE3WVIKN YQDMWBGCG CR 

i 

| BMP 4 CTELWSVNS- SIEKaCCVFT ELSAISMLYL DEYDPCWIOT IQfcMWEQCE CR 

i 
j 

| BMP 5 QELVnUtfTO EVPKFCCAFT KLNAISVLUr' DDSSNVUKK YKNMWRSCG CR 

BMP 6 QTLVHLMNPE YVEKTOCftKr KEJJRISVLXF DDNSNVUXK YRNMWMCG CH 

| 

! BMP 7 QTLVHEENPE TVPKPQCAPT QLNATSVLYF DDSSNVHJOK YHNMWMCC CH 

i 

i 
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Figure lb 



10 20 30 40 50 

HP121 IQFHGYSMNF CXGQCPIHIA MdAASTB SKVEKLLSSN TMGTIGH2 



i InhiiflA lAPSGYHftNT (TEWfaaiA CT9GSSLSEB ST7INHXRHR (3SPESNKS 

| InhilfiB I&PTCTrarc CBSSCPAILA GVPCSftSSTH TAWNJXRMR GUJP-GTVNS 

1 

j Inhiixz VZPPSF1ZHY PHim^HIP PNLSLP VBQBPmQ PVSTT»qgp 

i 

• I + *++ + ** * l I I If + 4+ + »++++++ , + + 

i 

•;' SO 70 80 90 

j 121 cc—vpom ELsmnmRD shivkid-ip twwEficocs 

1" 

I 

\ ' lnhiH3A OC— VPTKLR P&SfUTCOLG CWUKKD-VP HKTTEETGCS 

i 
i 

TnMhfi B OC— IPHCLS XJGMLEFDDE YNIVKRD-VP NMTVEECQCA 

Inhiixz CCftALPHlMR PLHVKTTSK; GKEKSETVP NUJlQBCftCI 
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y.iourg 2a 



OD 

BMP 2 
BMP 3 
UUP 4 
B1P 7 
TGF-G1 

inhibin a 
inhibin 8 A 
inhibin Sg 




Figure 2b 



Eco RI 



OED 
BMP 2 
BMP 3 
BMP 4 
BMP 7 
TQM31 

TGF-B3 
inhibin a 
inhibin b A 
inhibin Q D 
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Claims 

A DNA sequence encoding a protein of the TGF-8 family 
selected from the following group: 

(a) a DNA sequence comprising the nucleotides 
ATGAACTCCATGGACCCCGAGTCCACA 

(b) a DNA sequence comprising the nucleotides 
CTTCTCAAGGCUAACACAGCTGCAGGCACC 

(c) DNA sequences which are degen^rat* as a result of 
the genetic code to the DNA sequences of (a) and 
(b) 

(d) allelic derivatives of the DNA sequences of (a) and 
(b) 

(e) DNA sequences hybridizing to the DNA sequences in 
( a )/ (b), (c) or (d) and encoding a protein 
containing the aminoacid sequence 

Met-Asn-Ser-Met-Asp-Pro-Glu-Ser-Thr 
oV 

Lau-Lcu-Lys-Ala-Asn-Thr-Ala-Ala-Gly-Thr 

(f) DNA sequences hybridizing to the DNA sequences in 
( a ) 9 (c) and (d) and encoding a protein having 
essentially the same biological properties. 
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2 ■ The DBA sequence according to claims 1 which is a 

vertebrate DNA sequence, a mammalian DNA sequence, 
preferably a primate, human , porcine, bovine/ or rodent 
DNA sequence, and preferably including a rat. and a mouse 
DNA sequence. 

3 . The DNA sequence according to claim 1 or 2 which is a 
DNA sequence comprising the nucleotides as shown in SEQ 
'ID NO. 1, 

4. The DNA sequence according to claim 1 or 2 which is a 
DNA sequence comprising the nucleotides as shown in SEQ 
ID NO. 2. 

5. A recombinant DNA molecule comprising, a DNA sequence 
according to any one of claims I to 4'. 

6. The recombinant DNA molecule according .to claim 5 in 
which said DNA sequence Lb functionally linked to an 
expression-control sequence. 

7. a host containing a recombinant DNA molecule according 
to claim -5 or 6 . 



e« The host according to claim 7 which is a barrhfiriiiin, a 
fungus, a plant cell or an animal cell. 

9. A process for the production of a protein of the TGF-fl 
family comprising cultivating a host according to claim 
7 or 8 and recovering said TGF-Q protein from the 
culture > 

10. A protein of the TGF-3 family encoded by a DNA sequence 
according to any one of claims 1 to 4. 



CI IQCTITI ITC CUCCT 



OCT. 1997 - 8:50 



BRE1EJS ROUSSEL 33 0149914610 




r- 



1663 



P. 42/45 



WQ 93/16099 



PC^EP93/(WI350 



* 



-25- 



11. A protein according to claim 10 comprising the amino 
acid sequence of SEQ ID HO: 3. 



TGF-3 family according to claim 10 or 11 f optionally in, 
combination with a pharmaceutical^ acceptable carrier* 

13. The pharmaceutical composition according to claim 12 for 
the treatment of various bone, cartilage or tooth 
defects, and for use in wound and tissue repair 
processes . 

14 . A process for the production of a cDNA fragment 
comprising purifying xnRNA Iruia a tissue, amplifying the 
desired sequences using degenerated related 
oligonucleotides as primers, selecting the desired cDNA 
sequences by digesting undesired amplified cDNA 
sequences using restriction enzymes, amplifying the 
retained CDNA fragments and optionally determining their 
DNA sequence. 

15. An antibody or antibody fragment which is capable of 
specifically binding to a protein of claims 10. or 11. 

16. Antibody or antibody fxagment according to claim 15 
which is <a monoclonal antibody. 



12- A pharmaceutical composition containing a protein of the 



17. 



Use of an antibody or antibody fragment according to 
claims 15 or 16 for diagnostic methods. 
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Figure 1 a 



10 20 30 40 50 

MS! 52 CSRXALHW? KDMGKDDWII ASLEYZAFHC ZGLCEFPLRS HLEPTNHAVI 

BK> 2 CKRHPLYVDc SDVGWNDWIV APPGYHAFYC HGECPFPLAD HLNSTNHAIV 

BMP 4 CRRH3LYVDF SDVGWNDWIV APPGYQAFYC HGDCPFPLAD HLNSTNHAIV 

Blip 5 CKKHZLYVSF RDLGWQDWII APECYAAFYC DGEC5FPLNA HMNATNHAIV 

i 
I 

BliP 6 CRKHELYVSr QDLGWQDWII APKGY&ANYC DGECSFPLNA HMNATNHAIV 

i 

I 
l 

BmIp 7 CKKHELrvsr RDLGWQUWII APEGYAAYYC EGECAFPLMS YMNATNHAIV 

! 
i 

! 50 70 80 90 100 

.MP! 52 UTLMN5WDPE ST?PTCCV?T RLSPISIL7T DSANNWYKQ YEDMWZSCG CP. 

i 

Bli 2 QT-LVNSVNS- KIPKACCVFT ELSAISM1XL OENEKWLKN YQDMWZGCG CR 

i 

BM^ A QTLWSVNS- SIPKACCVPT ELSAISMLVL ' DEYDKWLKN YQEMWEGCG CR 

BMt 5 OTLVHLMrPD HVPKPCCAPT KLNAISVLYF DDSSNVXLKK YRNMWHSCG CH 

BMP 6 QTLVHLMNys YV?KPCCAPT KLNAISVLYF DDNSNVILKK YRNMWRACG CH 

BM^ 7 QTLVHriNFE TVPKPCCAPT QLNAI5VLXF DDS3N V TRNMVVRACG CH 
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Figure lb 



10 20 30 40 50 

HP121 IQPEGYAMNF CIGQCPLHIA GMPGIAASFH TAVLNLLKAN TAAGTTGGGS 

i 

^ahib[5A IAPSGYH&NY CSGSCPSHIA CTSCSSLSrn STVINHYRMR GHSPFANLKS 

^nJlifiPB IAPTG-YGNSf CEGSCPAYLA GVPGSASSFH TAWNQYRMR GLNP-GTVNS 

i 

Iufciba VYFPSFIi'HY CHGGCGLHIP PNT.SLP VPGAPFTPAQ FYSLLFGAQB 

j + * ++ 4 * * *+++++ 4 + + + 444 44 + + 

! 

60 70 80 30 

h?P121 CC — VPTARR FLST.LYYDRD SNIVKTD-IP DMWSACGCS 

• i 

ilnHlbPA CC— VPTKLK PMSMLYYDDG QNIIKKD-IQ NMIVEECGCS 

IhhlbpB CC— IPTKLS TMSMLXTDDE YNIVXRD-V? NMIVEECGCA 

xj*iba CCAALPGTMR PLHVRTTSDG " GrSrXYETVP NLLTQHCACI 

' | ** 4» + J- ++4 ++44 4+ -F* +4+ T 44 *+*4 
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Fjcnire ?^ 



( ? 1663 . P. 45/45 



PCT/EP93/0O3S0 



OD 

BMP 2 

BMP 3 

BMP 4 

BMP 7 

TGT-B1 

TGF-B2 

TGF-B3 

inhibin 

inhibin 

inhibin 



& 



B 



Eea St Me* ) 

ATGAATTCCCATGGACCTGGGCTGGMAKGAMTGGAT 
ACGTGGGGTGGAATGACTGGAT 
ATATTGGCTGGAGTGAATGGAT 
ATGTCGGCTGGAATGACTGGAT 
ACCTGGGCTGGCAGGACTGGAT 
AGGACCTCGGCTGGAAGTCGAT 
GGGATCTAGGGTGGAAATGGAT 
AGGATCTGGGCTGGAAGTGGGT 
ACCTGGGCTGGGAACGGTGGAT 
ACATCCGCTGGAATGACTGGAT 
TCATCGGCTGGAACCACTCGAT 



OID 

BMP 2 

BMP 3 

BMP 4 

BMP 7 

TCF-M 
TGF-JJ2 
TGF-B3 
inhibin a 
inhibin fl. 

,i, A 

inhibin flg 



icoK I 

ATGAATTCGAGCTGCGTSGGSRCACAGCA 
GAGTTCTGTCGGCACACAGCA 
CATCTTTTCTGGTACACAGCA 
CAGTTCAGTGGGCACACAACA 

gagctccgtgggcgcacagl-a 
cagcgcctgcggcacgcagca 
taaarcttgggacacgcagca 
caggtcctggggcacgcagca 
ccctgggagagcagcacagca 
™gcttggtgggcacacagca 

CAGCTTGGTGGGAATCCAGCA 



